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Abstract 

Background: Phage-encoded serine integrases, such as cpC31 integrase, are widely used for genome engineering. Fifteen 
such integrases have been described but their utility for genome engineering has not been compared in uniform assays. 

Results: We have compared fifteen serine integrases for their utility for DNA manipulations in mammalian cells after first 
demonstrating that all were functional in £ coli. Chromosomal recombination reporters were used to show that seven 
integrases were active on chromosomally integrated DNA in human fibroblasts and mouse embryonic stem cells. Five of 
the remaining eight enzymes were active on extra-chromosomal substrates thereby demonstrating that the ability to 
mediate extra-chromosomal recombination is no guide to ability to mediate site-specific recombination on integrated DNA. 
All the integrases that were active on integrated DNA also promoted DNA integration reactions that were not mediated 
through conservative site-specific recombination or damaged the recombination sites but the extent of these aberrant 
reactions varied over at least an order of magnitude. Bxbl integrase yielded approximately two-fold more recombinants and 
displayed about two fold less damage to the recombination sites than the next best recombinase; cpC31 integrase. 

Conclusions: We conclude that the Bxbl and cpC31 integrases are the reagents of choice for genome engineering in 
vertebrate cells and that DNA damage repair is a major limitation upon the utility of this class of site-specific recombinase. 
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Background 

Serine integrases are phage-encoded site-specific recombinases 
that promote conservative recombination reactions be- 
tween short (40-60 bp) DNA substrates located on the 
phage (phage attachment site, attP) and bacterial (bacter- 
ial attachment site, attB) chromosomes [1]. The product 
of attP x attB recombination is an integrated prophage 
flanked by two new recombination sites, attL and attR, 
each containing half sites derived from attP and attB. In 
the absence of accessory factors the integrases mediate 
unidirectional recombination between attP and attB 
with greater than 80% efficiency. In the presence of a 
phage-encoded accessory protein, the recombination 
directionality factor (RDF) the attP x attB recombin- 
ation is inhibited and the attL x attR recombination is 
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stimulated [2,3]. In this way integration (attP x attB) 
and excision (attL x attR) of the phage genome are 
under strict controls and in tune with the phage life 
cycles. The unidirectional activity, short substrate sites 
and functional autonomy of these recombinases has 
made them a useful complement to the widely used re- 
versible recombinases of the tyrosine recombinase 
family such as Cre and Flp for genome engineering 
reviewed in [1]. In particular the unidirectional activity 
has made them valuable for the promotion of DNA in- 
tegration by recombinase-mediated cassette exchange 
reactions and for the development of iterative recom- 
bination approaches [4-7]. To date five serine integrases 
derived from phages cpC31 [8], (pBTl [9], Bxbl [10,11] and 
R4 [11,12] have been shown to be capable of promoting 
site-specific integration of DNA into mammalian genomes 
while TP901-1 [13], A118, FC1 and cpRV [14] have been 
shown to promote site-specific recombination in an 
extra-chromosomal environment in mammalian cells. 
With one exception these studies have, however, been 
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carried out largely independently of one another, in 
different cell lines, cells of different species and using 
different protocols. The exceptional study was that of 
Yamaguchi and colleagues [11] who compared the ac- 
tivities of the cpC31, Bxbl, TP901-1 and R4 integrases 
in mediating site-specific recombination into a hu- 
man artificial chromosome (HAC) isolated in hamster 
cells. This study exploited a promoter trap strategy 
and thus relied upon selection to assay recombination 
products. Importantly, however, the products were not an- 
alyzed at the level of DNA sequence. It was therefore nei- 
ther possible to determine the total level of recombination 
promoted by these different enzymes nor to determine the 
fraction of recombination events that had proceeded by 
reciprocal and conservative site-specific recombination. 

The discovery that site-specific recombination mediated 
by the (pC31 integrase is sometimes accompanied by DNA 
damage in vertebrate cells identified [15] posed the ques- 
tion as how far integrase associated DNA damage limits 
the use of the serine integrases as genome engineering re- 
agents. Damage of the type seen with the tpC31 integrase, 
has not been detected with the tyrosine recombinases, is 
likely to be a consequence of the DNA cleavage and strand 
exchange mechanism of the serine recombinases (reviewed 
in [16]). The recombination pathway begins with binding 
of integrase to the attachment sites, which are then brought 
together by protein:protein interactions to form a synaptic 
tetramer. The reaction then proceeds by the formation of 
concerted double strand breaks in both of the DNA 
substrates prior to subunit rotation and recombination. 
It seems likely that the damage that accompanies the 
activity of these recombinases in vertebrate cells arises 
as a consequence of these double strand breaks being 
recognized by the mammalian cell double strand break 
repair pathways. It should be noted that to date no 
damage has ever been observed to accompany the ac- 
tion of the serine integrases in bacteria and it may 
therefore be the eukaryotic chromatin environment or 
the nature of the mammalian repair pathways that 
leads to the damage seen in mammalian cells. The fre- 
quency and extent of the damage would seem likely to 
reflect the time spent by the integrases in the cova- 
lently linked, cleaved DNA complex as this is most 
likely to be the target for the repair pathways. 

In total there are fifteen phage-encoded serine integrases 
for which both of their attachment sites are known. Nine 
of these fifteen integrases have been characterised in reac- 
tions in mammalian cells, E. coli or in vitro (cpC31 [17], 
Bxbl [18], cpBTl [19], cpCl [14,20], MRU [21], TP901-1 
[22], R4 [12], A118 [14], and (pRV [14], [23]) while six 
(TGI, cp370.1 [24], WP [25], BL3, SPBc and K38) have not 
yet been shown to be active outside their native hosts. In 
total there are ten integrases whose utility as tools for in- 
tegrating DNA into mammalian genomes has not been 



investigated. We have therefore set out to rank the ac- 
tivities of all fifteen of these serine integrases for which 
the sites are known by the criteria of both accuracy 
and efficiency in two different mammalian cell lines; 
human HT1080 cells and mouse ES cells. These stud- 
ies have provided us with a clear rank order for the 
utility of this important class of enzymes as tools for 
vertebrate genome engineering with Bxbl integrase 
mediating the most efficient and accurate site-specific 
recombination in this heterologous environment. The 
cpC31 integrase comes a close second. For the remaining 
integrases, we demonstrate that DNA damage is an im- 
portant factor in limiting the utility of members of this 
class of enzymes for promoting integration reactions in 
mammalian cells. 

Results and discussion 

Fifteen unidirectional 'phage integrases are active in E. coli 

Fifteen unidirectional phage integrases have been de- 
scribed for which the attachment sites are known. We 
first set out to confirm that each integrase and their cog- 
nate attachment sites were active by expression of each 
integrase gene in E. coli in the presence of a reporter 
plasmid. Fifteen reporter plasmids (conferring chloram- 
phenicol resistance) were constructed in which the 
lacZa gene was flanked by attB and attP sites for each 
of the respective integrases (Figure 1A). These plasmids 
conferred exclusively blue colony when transformed into 
an E. coli strain containing the lacZAMIS mutation and 
plated on selective agar in the presence of X-gal and IPTG 
(Figure IB, top row). We next introduced genes encoding 
each one of the integrases tagged at the N-terminal end 
by a StrepII tag and at the C-terminal end by a SV40 large 
T antigen nuclear localization signal into the E. coli ex- 
pression plasmid pET21a (conferring ampicillin resist- 
ance). Initially we introduced the integrase expression 
plasmids into DH5a E. coli K12 strains containing a cog- 
nate reporter plasmid and scored the transformants as 
white or light blue, indicative of recombination, or blue, 
indicative of no recombination. Active integrases were 
from phages <pC31, Bxbl, TGI, TP901-1, A118, SPBc, W|3, 
cpBTl and cp370.1 in this assay (Figure IB, middle row). 
The transformants containing integrase genes that only 
gave rise to blue or light blue colonies were picked and 
restreaked to single colonies and white colonies were ob- 
served from strains expressing BL3, FC1 and K38 
integrases whereas no white segregants were observed 
from strains containing MRU, (pRV and R4 int genes 
(Figure IB, bottom row). 

As the expression of the integrases might be limiting in 
bacteria containing the MRU, cpRV and R4 integrase 
genes, we repeated the recombination assay in E. coli 
BL21(DE3), a strain that should over-express the integrase 
genes on induction with IPTG. Competent BL21(DE3) 
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Figure 1 Assaying integrase activity in E.coli. A. The reporter plasmid used to assay activity of the integrases in £ coli. This plasmid, derived 
from pACYC184, contains lacZa gene encoding the LacZa peptide flanked by integrase attachment sites. The intact reporter plasmid confers 
B-galactisidase activity on a strain containing the AlacZM15 allele and therefore the colonies appear blue on agar containing presence of X- 
Gal and IPTG. Active integrase promotes site-specific recombination between the attP and off 6 sites resulting in deletion of the lacZa gene 
and the colonies appear white. B. The appearance of E.coli containing the reporter plasmid with or without an integrase expression plasmid 
in the presence of X-Gal and IPTG. 



containing the different reporter plasmids were pre- 
pared and the integrase containing expression plas- 
mids introduced by transformation with selection for 
ampicillin and chloramphenicol resistance. Expression 
of integrase was induced by addition of IPTG to loga- 
rithmically growing cells and cultures were further incu- 
bated overnight at 20°C. Plasmid DNA was extracted from 
1 ml of each culture and used to transform plasmid free 
DH5ct scoring for white and blue colonies. All of the plas- 
mids extracted from the E. coli BL21(DE3) cells expressing 
the integrases gave exclusively white colonies with the ex- 
ception of strains that had contained MRU or tpRV 
integrases, which yielded 50% and 25% unrecombined 
plasmid, respectively (Table 1). The control BL21(DE3) 
strains that contained the reporter plasmids and the 
empty expression plasmid (pET21a) remained stable 
with no loss of the lacZa gene. Accurate site-specific 
recombination for all of the integrases was confirmed 
by PCR and sequencing of the atth sites recovered from 
plasmids present in the white colonies. Cells from the 
same cultures were used to assay the level of expression of 
the StrepII-tagged integrases by western blots (Table 1). A 
StrepII-tagged integrase of the expected molecular weight 
was detected from all the BL21 (DE3) strains expressing 
an integrase except those containing the W|3, Bxbl, BL3 
or R4 integrase genes. The amount of protein present was 



Table 1 In vivo recombination activity after 
overexpression of integrases in E. coli BL21(DE3) 



Integrase ug tagged protein/ml 1 % recombination 2 



Bxb1 


ND 


100 


wp 


ND 


100 


BL3 


ND 


100 


cpR4 


ND 


100 


Al 1 8 


0.6 


100 


TG1 


2.7 


100 


MRU 


2.9 


50 


cp370 


4.1 


100 


SPBc 


9.0 


100 


TP901-1 


32.3 


100 


cpRV 


55.9 


75 


FC1 


74.7 


100 


K38 


319.6 


100 


cpBTI 


350.4 


100 


cpC31 


758.2 


100 



The amount of Strepll tagged integrase detected in a Western blot from 1 ml 
of culture. 

2 The % of plasmid extracted from the cultures that had 
undergone recombination. 
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determined by comparing the intensities of the bands 
from the western blotting with those from a series of stan- 
dards (Table 1). While there was no simple relationship 
between the amounts of protein detected in the cultures 
and the level of recombination, this experiment demon- 
strated that all the integrases and their attachment sites 
were active in E. coli. 

Assaying the unidirectional phage integrases in 
vertebrate cells 

The activities of the integrases were then measured in 
vertebrate cells. In order to do this we constructed three 
reporter plasmids that could be used to assay the ability 
of all of the different integrases to mediate either dele- 
tion or integration in mammalian cells. Each of the re- 
porter plasmids included arrays containing attP or attB 
sites for all 15 integrases arranged head-to-tail. For the 
deletion assay we used a plasmid, called 'attP array CCAG 
HyTK attB array' (Figure 2A), in which a counter select- 
able marker gene, a fusion between a gene conferring 



resistance to hygromycin and a herpes simplex virus thy- 
midine kinase gene that confers sensitivity to the nucleo- 
side analogue gancyclovir, HyTK, was placed between 
arrays of attB and attP sites in a head-to-tail orientation 
(Figures 2B and 2C). If integrase mediated recombination 
occurs between its cognate attP and attB sites flanking 
the HyTK gene, the cells become resistant to gancyclovir. 
The assay for integration activity was based upon the use 
of two plasmids. The first of these, called 'attP array 
CCAG HyTK attP array! contained the docking attP sites, 
flanking the counter selectable marker (Figure 2A) and 
the second, called 'attB array CCAG neo attB array! 
contained the incoming attB sites flanking a gene encod- 
ing resistance to the antibiotic G418 (Figure 2D). Cassette 
exchange between the incoming attB sites and the docking 
attP sites is expected to yield chromosomes containing attR 
and attL flanking the integrated neo gene and the cells dis- 
play both gangcylovir and G418 resistance. 

In both assays the integrases were expressed using 
plasmids (Figure 2E) in which each integrase gene had 
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Figure 2 Assay system for integrases in mammalian cells. The reporter plasmid design (A) used to assay either site-specific deletion or 
integration promoted by serine integrases in vertebrate cells. In the deletion reporter construct, called attP array CCAG HyTK atrB array, the 
counter selectable gene CCAG HyTK was placed between an array of attB sites (B) and an array of attP sites (C). In the integration or 
recombinase-mediated cassette exchange constructs, the docking construct, attP array CCAG HyTK attP array, had the CCAG HyTK gene flanked 
by two arrays of attP sites and the reporter construct, termed attB array CCAG neo attB array (D) contained the CCAG Neo gene conferring 
resistance to G418 flanked by arrays of attB sites. The integrase expression constructs are shown schematically in (E) each containing an int gene 
modified at the 5' and 3' ends to encode a Strepll tag and a nuclear localization signal, respectively, and placed down-stream of a CCAG promoter 
and upstream of an internal ribosome entry site and a dominant selectable marker conferring either zeocin or xanthine resistance (ecogpt). 
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been codon optimized and tags placed at the 5' and 3' 
ends encoding the StrepII tag and the nuclear localization 
signal. An internal ribosome entry site (IRES) followed by 
an antibiotic resistance gene was placed downstream of 
each integrase gene. 

The use of arrays of attachment sites in the reporter 
plasmids had three merits: Firstly it enabled a strategy 
that ensured that comparisons between the different 
integrases were not compromised by position effects 
arising as a result of the target sites for the different 
integrases being integrated at different positions in the 



vertebrate genome. Secondly it allowed us to determine 
whether there was any cross-reactivity between the dif- 
ferent integrases and their attachment sites. Finally this 
approach proved to be efficient, requiring only a small 
number of cell lines to be used for all of the necessary 
assays. However as a precaution, we wanted to exclude 
the possibility that placing the attachment sites within 
an array altered their activity and so we compared the 
in vitro activity of one of the integrases, cpC31 integrase, 
using substrates in which the attachment sites were ei- 
ther isolated or present in the arrays (Figure 3A-E). 



A pRT702 (attP) x 

pRT600 (attB) 
[Int] 0 



B pRT700 (attP) x 
pUC57_attB array 

[Int] 0 



C pUC57_attP array x 

pRT600 (attB) 
[Int] o 




M 1 2 3 4 

D pCCAG_attP array x 
pUC57_attB array 

[Int] o 




M 1 2 3 
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4 attR 
attB 
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integrase [nM] 
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Figure 3 Comparing the activity of cpC31 integrase on isolated attachment sites or attachment sites located in the multi-site arrays. 
A-E. Assays cpC31 integrase activity on attachment sites in different configurations. The indicated plasmids containing either attB or attP sites 
were incubated in the absence or in the presence of 0 nM (lane 1), 100 nM (lane 2), 200 nM (lane 3), 350 nM (lane 4) and 700 nM (lane 5) cpC31 
integrase. Recombination reactions were digested with either Hindlll or BamHI and the products size fractionated by agarose gel electrophoresis. 
F. The data shown in A-E were analyzed densitometrically and the fraction of the substrate converted into product calculated as described in the 
materials and methods. This comparative analysis was carried out in two independent experiments. 
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These experiments showed that the recombination effi- 
ciencies of the attachment sites were similar whether 
they were isolated or present in the arrays. 

Identification of eight unidirectional phage integrases 
promoting site-specific deletion in mammalian cells 

Firstly we wanted to determine which of the fifteen 
integrases were active on substrates integrated into verte- 
brate genomes. The most sensitive way in which to detect 
activity is to assay recombination between attB and an 
attP sites in cis because the proximity of the sites favours 
the kinetics of synapsis, the process by which integrase 
bring the substrates together in a tetramer prior to DNA 
cleavage. We therefore transfected the construct designed 
to assay deletion, attP array CCAG HyTK attB array, into 
human HT1080 cells by electroporation and selected for 
stable transfectants using hygromycin. We screened 96 
stably transfected clones for the integrity of the integrated 
DNA using PCR, checked for single copy integrants by re- 
striction enzyme analysis and filter hybridization and then 
confirmed the integrity of the attP and attB arrays by se- 
quencing. In this way we recovered four independent, sta- 
bly transfected clones from two independent transfections, 
each containing a single copy of the integrated deletion re- 
porter construct, attP array CCAG HyTK attB array. 

In order to compare the activities of the different 
integrases we first transiently transfected 10 5 HT1080 
cells containing the attP array CCAG HyTK attB array 
reporter with expression plasmids for each of the integrases 
using Lipofectamine and assayed for recombination activity 
by selecting for resistance to gancyclovir, a drug which is 
selectively toxic for cells expressing the HyTK fusion. None 
of the integrases gave a significant increase in the number 
of gancyclovir-resistant cells as compared to the empty ex- 
pression vector (Adiitional file 1: Table S3) and so we 
assayed pools of the resistant cells for the presence of the 
recombinant attR site using PCR. Recombination activity 
was detected in populations of cells transfected with the 
R4, (pC31, cpBTl, Bxbl, SPBC and Wp integrase expression 
constructs (not shown). However the low level of activity 
overall made it impossible to conclude anything about the 
integrases that did not yield attR in the PCR reactions as 
the integrases may be active but causing damage that re- 
moved the attR primer binding sites, may simply be 
slow in promoting site-specific recombination or com- 
pletely inactive. Moreover the variability of the relative 
numbers of gancyclovir resistant clones generated in dif- 
ferent experiments made an accurate comparison between 
the active integrases impractical. 

We therefore used electroporation to transfect linear- 
ized integrase expression constructs into each of the two 
independent cell lines containing the attP array CCAG 
HyTK attB array reporter used in the transient expres- 
sion experiments and selected for clones containing stably 



integrated, integrase expression constructs (Additional file 1: 
Table S4). The yield of clones generated by the integrase ex- 
pression constructs was at least an order of magnitude lower 
than those recovered following transfection with the empty 
expression vector suggesting that all of the integrases were 
toxic to various degrees. The TGI and (p370.1 integrases 
appeared particularly toxic by this criterion because these 
consistendy gave us no stably transfected clones. We divided 
the clones that had been stably transfected with each of the 
integrases into two groups of approximately equal sizes. We 
used the first of these groups to estimate the ability of the 
integrases to promote site-specific recombination of the attP 
array CCAG HyTK attB array reporter construct within the 
first two weeks of exposure to integrase. In order to do this 
we applied gancyclovir selection to these clones and then 
assayed individual gancyclovir resistant clones for one of the 
recombinant products, attR, by PCR. The six integrases that 
had shown clearly detectable activity in the transient trans- 
fection experiment (R4, cpC31, cpBTl, Bxbl, SPBc and Wp) 
showed activity after stable expression of the integrase 
(Figure 4A). The remaining integrases showed no evi- 
dence for site-specific recombinase activity following 
gancyclovir selection. We wanted to determine how 
accurately the six active integrases were mediating 
site-specific recombination. We therefore sequenced at 
least 7 PCR products containing recombinant attR 
sites derived from each of the six active integrases and 
showed that, with the notable exception of the R4 
integrase, all exclusively yielded products that were consist- 
ent with conservative site-specific recombination (Table 2). 
In the case of the R4 integrase only 2/9 PCR products 
contained intact attR sites, demonstrating that this enzyme 
seems particularly damage prone, at least in human cells. 

We wanted to know whether the complete failure to 
detect deletion activity after two weeks of growth in the 
presence TP901-1, FC1, cp370.1, K38, cpRV, A118, BL3 
and MRU integrases was because these recombinases 
were slow or because they were damaging the target 
sites. We therefore applied hygromycin selection to the 
second group of clones that had been stably transfected 
with the integrase expression construct but not exposed to 
gancyclovir, then relaxed hygromycin selection and 
analysed the clones for any detectable recombination after 
a further two weeks by PCR. The results were clear; 
TP901-1 integrase showed detectable recombination after 
further culture indicating that it was indeed slow but the 
remaining integrases showed no evidence of recombination 
(Additional file 1: Table S4). As before we determined the 
sequence of the PCR products containing the predicted 
attR site generated by the TP901-1 integrase; 4 were intact 
attR sites and 1 was damaged thus showing that TP901-1 
integrase is like the R4 integrase and prone to site damage. 
We wanted to know whether the remaining 8 integrases 
that had failed to show productive recombination were 
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Figure 4 Comparing the activity of fifteen serine recombinases in human HT1080 cells and in mouse ES cells. A. HT1080 cells containing 
a single integrated attP array CCAG HyTK attB array reporter construct were stably transfected with an integrase expression plasmid selecting for 
antibiotic resistance. The transfected clones were then subjected to gancyclovir selection to identify those that had lost the CCAG HyTK marker 
gene and then screened for recombinant attR sites to identify those that had undergone integrase-mediated site-specific deletion of CCAG HyTK. 
The two bars shown for each integrase reflect the results of two independent experiments. (Additional file 1: Table S4). None of the other eight 
integrases yielded recombinant gancyclovir resistant clones and they are not shown in this part of the figure. B; ES cells containing a attP array 
CCAG HyTK arfS array reporter integrated at the ROSA26 locus were transiently co-transfected with an integrase expression plasmid and a 
linearized PGKneo construct to normalize for differences in the efficiency of transfection. Gancyclovir resistant clones were screened for attR sites. 
Two experiments were carried out and the bars reflect the means of the normalized activities (Additional file 1: Table S5). The analyses were 
carried out twice on a single attP array CCAG HyTK attB array reporter cell line and thus the data is shown as the mean of the two experiments. 
C; HT1080 cells that had been transfected with the empty expression vector (CCAG iresZeo) or with the indicated integrase expression vector 
were transiently transfected with the attP array CCAG HyTK attB array reporter and scored for site-specific recombination by PCR. For cp370 and 
TG1 integrases the integrase expression vector was co-transfected with the reporter construct, the remaining experiments were carried out with 
HT1080 cell lines that had been stably transfected with the respective integrase expression construct. 



attempting recombination but were in fact damaging 
the attachment sites. We therefore sequenced their 
substrate attachment sites {attP and attB) in the inte- 
grated reporter plasmids in two of the hygromycin resist- 
ant clones that had been transfected with the respective 



integrase expression constructs. None showed evidence of 
site damage. 

Thus we concluded that the R4, cpC31, cpBTl, Bxbl, 
SPBc, TP901-1 and W(3 integrases are active on sub- 
strates integrated in to the genome of HT1080 cells 
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Table 2 Accuracy of integrase mediated site-specific recombination in human HT1080 and mouse ES cells 



Integrase Sequence Integrations Deletions 

BxB1 HT1080 ES 

attB TCGGCCGGCTTGTCGACGACGGCGGTCTCCGTCGTCAGGATCATCCGGGC 

attP GTGGTGGTTTGTCTGGTCAACCACCGCGGTCTCAGTGGTGTACGGTACAAACGCCGAC 

attL TCGGCGGGGTGTCGACGAGGGCGGTCTCAGTGGTGTACGGTACAAACGCCGAC 3/3 

attR GTCGTGGTTTGTCTGGTGAACCACCGGGGTGTGCGTCGTCAGGATCATGCGGGC 7/7 2/2 

fC31 

attB TGGGGGTGGCAGGGGGTGCCCTTGGGCTGCCCGGGCGCGTACTCC 

attP GTGCCCCAAGTGGGGTAACCTTTGAGTTCTCTCAG1TGGGGG 

attL TGCGGGTGCGAGGGCGTGCGCTTGAGTTCTCTCAGTTGGGGG 3/3 

attR TTGCCCGAACTGGGGTAACCrrTGGGCTGCCCGGGCGCGTAGTGC 7/7 3/3 

R4 

attB GCGCCCAAGTTGCCCATGACCATGCCGAAGCAGTGGTAGAAGGGCACCGGCAGACAC 

attP AGGCATG1TGCCCAAAGCGATACCACTTGAAGCAGTGGTACTGCTTGTGGGTACACTCTGC 

attL GGGCGCAAGTTGCGCATGACCATGCGGAAGCAGTGGTACTGCTTGTGGGTACACTCTGC 3/3 

attR AGGCATGTTCCCGAAAGCGATACGACTTGAAGGAGTGGTAGAAGGGCACCGGCAGACAC 2/9 1/2 

del clone 1 DattR AGGCATGTTCCGCAAAGGGATACGAC AAGCAGTGGTAGAAGGGCACCGGGAGACAC 

del clone 2 DattR AGGGATGITCGCCAAAGCGATACCACTT AAGCAGTGGTAGAAGGGCACCGGGAGACAC 

del clone 3 DattR AGGCATGTTCCCCAAAGCGATACCACTTG GCAGTGGTAGAAGGGCACCGGCAGACAC 

del clone 4 DattR AGGCATGTTCCCCAAAGCGATA GCAGTGGTAGAAGGGCACCGGCAGACAC 

del clone 5 DattR AGGCATGTTCCCCAAAGCGATACCA GTGGTAGAAGGGCACCGGCAGACAC 

del clone 6 DattR AGGCATGTTCCCCAAAGCGATACCA GAAGCAGTGGTAGAAGGGCACCGGCAGACAC 

del clone 7 DattR AGGCATGTTCCCCAAAGCGATACCAC AAGCAGTGGTAGAAGGGCACCGGGAGACAC 

ES del clone 1 DattR AGGCATGTTCCCCAAAGCGATACCA AGCAGTGGTAGAAGGGCACCGGCAGACAC 1/2 

intn clone 1-3 DattR AGGCATGTTCCCCAAAGCGATA CAGTGGTAGAAGGGCACCGGCAGACAC 3/3 
fBTI 

attB GTCCTTGACCAGGTTTTTGACGAAAGTGATCCAGATGATCCAGCTCCACACCCCGAACGC 

attP GGTGCTGGGTTGTTGTCTCTGGACAGTGATCCATGGGAAACTACTCAGCACCACCAATG 

attL GTCCTTGACCAGGTTTTTGACGAAAGTGATCCATGGGAAACTACTCAGCACCACCAATG 3/3 

attR GGTGCTGGGTTGTTGTCTCTGGACAGTGATCCAGATGATCCAGCTCCACACCCCGAACGC 7/7 2/2 

intn clone 1 DattR GGTGCTGGGTTGTTGTCTC GATCCAGATGATCCAGCTCCACACCCCGAACGC 1/3 
WB 

attB AAGGTAGCGTCAACGATAGGTGTAACTGTCGTGTTTGTAACGGTACTTCCAACAGCTGGCG 

attP TAGTTTTAAAGTTGGTTATTAGTTACTGTGATATTTATCACGGTACCCAATAACCAATGAA 

attR TAGTTTTAAAGTTGGTTATTAGTTACTGTGATATTTGTAACGGTACTTCCAACAGCTGGCG 7/7 2/2 

SPBc 

attB AGTGCAGCATGTCATTAATATCAGTACAGATAAAGCTGTATCTCCTGTGAACACAATGGGTG 7/7 2/4 

attP AAAGTAGTAAGTATCTTAAAAAACAGATAAAGCTGTATATTAAGATACTTACTAC 

attR AAAGTAGTAAGTATCTTAAAAAACAGATAAAGCTGTATCTCCTGTGAACACAATGGGTG 

ES del clone 1 DattR AAAGTAGTAAGTATCTTAAAAAACA AGCTGTATCTCCTGTGAACACAATGGGTG 1/4 

ES del clone 2 DattR aaQafocagcgfffffcafgtQcaactafacf GCTGTATCTCCTGTGAACACAATGGGTG 1/4 
TP901 

attB TGATAATTGCCAACACAATTAACATCTCAATCAAGGTAAATGCTTTTTCGTTTT 

attP AATTGCGAGTTTTrATTTCGTTTATTTCAATTAAGGTAACTAAAAAACTCCTTT 
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Table 2 Accuracy of integrase mediated site-specific recombination in human HT1080 and mouse ES cells (Continued) 

attR AATTGCGAGTTITTATTTCGTTTATTTCAAGGTAAATGCTnTTCGTnT 4/5 

del clone 1 DattR AATrGCGAGTTTTTATTTCGnTATTTC GGTAAATGCrmTCGTITT 1/5 

The results show the sequences of the recombinant sites detected following integrase mediate site specific recombination in human HT1080 or mouse ES cells. 
Underlined and in lowercase in the case of one sequence from a deletion event occurring in ES cells following expression of the SPBc integrase is the sequence of 
a flanking (p370.1 attP site discussed in the text. 



although the TP901-1 integrase promotes recombination 
slowly and that the FC1, cpK38, RV, A118, BL3 and 
MRU integrases are not detectably active on integrated 
substrates. We cannot make any statement about the 
TGI and cp370.1 integrases because we were unable to re- 
cover clones expressing these integrases for experimental 
analysis. We attempted to use western blotting to assay 
for expression of the integrases using with an antibody to 
the N-terminal StrepII tag but we were unable to detect 
any signal with any of the integrases suggesting that they 
are all expressed at low levels. Rank ordering of the dele- 
tion activities of the integrases in this experiment indi- 
cated the following Bxbl = q>C31 = cpBTl > R4 = W(3 > 
SPBc > TP901-1. The utility of the R4 integrase would 
seem to be limited by its liability to site damage. The pur- 
pose of this part of our project was to identify those en- 
zymes that were active in vertebrate cells and we did not 
investigate the background of GANC r attR clones. Al- 
though such clones were seen in the cells transfected with 
the empty vector and may arise from background silen- 
cing of the attP CCAG HyTK attB indicator gene or from 
loss of the chromosome carrying this gene, they occur at a 
higher level in the clones that had been transfected 
with integrases (Additional file 1: Table S4). We cannot 
exclude the possibility that they arise as a result of 
recombinase-mediated target site damage, although the 
accurate recombination activities seen with five of the 
seven active integrases would suggest that this is unlikely. 
The source of these background clones is therefore 
unclear. 

It is also clear that not all of the clones that were suc- 
cessfully transfected with a construct expressing an ac- 
tive integrase yielded recombinant products. Thus even 
with the Bxbl and (pC31 integrases about 30% of the 
clones that were resistant to the antibiotic used to select 
for the presence of the expression construct failed to 
yield clones that were resistant to gancyclovir and contained 
an attR site. One hypothesis was that these clones failed to 
express sufficient integrase to bind and synapse the sub- 
strates, processes that are dependent on the affinity of 
integrase for its attP and attB sites and the expression level 
of integrase. We tried to test this idea by using the StrepII 
epitope with which we had tagged all of the integrases but, 
as before, were unable to do so for all of the integrases be- 
cause the StrepII epitope tag was insufficiently sensitive and 
no signal was obtained in the western blots. However we 
had specific polyclonal antibodies for the (pC31 and cpBTl 



integrases that we expected would be more sensitive and in- 
deed they allowed us to measure the presence of the re- 
spective integrases by western blotting,. The results of this 
analysis was consistent with the notion that at least for some 
integrases the failure of site-specific recombination 
was associated with inadequate or low levels of expres- 
sion (Additional file 1: Figure S2). The success of the 
western blotting using the polyclonal antibodies also 
demonstrates that the previous failure to detect integrase 
expression in the human cells using the StrepII tag and 
antibody was due to the relatively low sensitivity of this 
system. 

The results obtained with the human HT1080 cells posed 
the question of whether they were generally true for verte- 
brate cells. Genome engineering of mouse embryonic stem 
cells (ES cells) is widely practiced and so we used a deletion 
strategy to assay the activities of the integrases in mouse ES 
cells. We used sequence targeting to introduce the attP 
array CCAG HyTK attB array deletion reporter cassette 
into the ROSA26 locus (Additional file 1: Figure SI) and 
then assayed for deletion by gancyclovir resistance fol- 
lowing transient transfection with the set of integrase 
expression constructs described above. In order to avoid 
problems with toxicity that would compromise the prac- 
tical significance of any results we chose to use transient 
assays and an internal control to carry out the experiment. 
In this ES cell system there appeared to be a clearer differ- 
ence between the numbers of gancyclovir-resistant clones 
seen with the empty vector and those with expressing 
integrase than was observed in the HT1080 cells. The dif- 
ference was not absolute however and identification of ac- 
tive integrases also required PCR analysis for the presence 
of a recombinant attR site. The results (Additional file 1: 
Table S5 and Figure 4B) were similar but not identical to 
those seen in the HT1080 cells; R4, cpC31, cpBTl, Bxbl, 
SPBc, W|3 and TGI integrases showed detectable activity 
but the TP901-1 integrase did not. The activity seen with 
the TGI integrase was, however, weaker than seen with 
the others with only one out of eight clones containing a 
detectable attR site. Rank ordering of the deletion activities 
of the integrases in this experiment indicated the following 
W(3 > Bxbl >cpC31 > SPBo R4 > cpBTl. We analyzed the 
accuracy of the site-specific recombination mediating the 
deletion reaction in two gancyclovir-resistant clones 
generated by the Bxbl, R4, W|3 and SPBc integrases. 
These results demonstrated that the Bxb 1, cpBTl and 
W|3 integrases all mediated site-specific recombination 
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accurately but that the R4 integrases was associated with a 
deletion in one of the two recombination products analyzed 
and that both of the SPBc products were deleted. In the case 
of the SPBc integrase the region deleted 171 bp in the attP 
array extending as far as the cp370.1 attP site raising the pos- 
sibility of a lack of specificity in attP site recognition by the 
integrase. This however seems unlikely because the break- 
point in the (p370.1 attP site is not at the proposed recom- 
bination junction but 6 bp 3' of the point of symmetry 
defining the pseudo-palindrome of this attachment site. 

The failure to detect recombinase activity mediated by 
the (p370, FC1, TGI, RV, FC1, cpK38, MRU, A118 and 
BL3 integrases in HT1080 cells using deletion substrates 
integrated into the vertebrate genome posed the question 
as to whether these integrases were active in HT1080 cells 
or whether they were simply prevented from acting by the 
fact that the target sites were integrated into the genome. 
We therefore carried out a series of transient tranfection 
experiments in which the deletion substrate plasmid was 
transiently transfected into HT1080 cells that had been 
stably transfected with an expression plasmid for one of 
these integrases or, in the case of the q>370 and TGI 
integrases, (where such stably transfected cell lines did not 
exist) co-transfected the respective expression plasmid 
and deletion substrate plasmid and then analyzed the 
extracted DNA for deleted plasmid after 72 hours by PCR. 
The recombinants were assayed using one of two PCR re- 
actions for which the Bxb 1 or W[3 integrases acted as 
positive controls. The results (Figure 4C) demonstrate that 
the RV, cpK38, MRU, A118 and BL3 integrases were in 
fact active in the HT1080 cells and thus we conclude that 
these integrases are unable to promote site-specific re- 
combination when their substrates are integrated into the 
genome of the HT1080 cells but can do so when they are 
present extra-chromosomally. We can make no statement 
about the cp370, FC1, TGI integrases as we have no evidence 
to determine whether they are expressed. 

Comparative integration activities of seven unidirectional 
phage integrases in mammalian cells assayed by 
recombinase mediated cassette exchange 

Site-specific integration, and in particular the exchange 
of marker cassettes, collectively termed recombinase me- 
diated cassette exchange, is an important technique for 
the precise introduction of DNA into cells, in particular 
for the comparative analysis of different genes integrated 
at the same docking site. The uni-directional serine 
integrases are ideally suited to this application as they 
require only simple attachment sites and no host accessory 
proteins. We wanted to determine which of the seven 
integrases that we had identified above as being active in 
promoting site-specific deletion in human cells, also func- 
tioned in promoting site-specific cassette exchange. Our 
previous experiments had shown the merits of using cell 



lines that contained arrays of attachment sites that were 
then stably transfected with different integrase-expressing 
plasmids as reagents for accurate comparisons of integrase 
activity. We first introduced an integration target' attP 
array CCAG HyTK attP array expressing the hygromycin- 
thymidine kinase fusion flanked by arrays of attP sites into 
human HT1080 cells by electroporation and, as before, se- 
lected structurally intact, single copy integrants. We then 
transfected these cells with individual integrase expression 
plasmids for each of the seven integrases that had been 
shown to be active in the deletion assay and again selected 
for stable integrants. In order to compare the cassette ex- 
change activities of the different integrases we then transi- 
ently transfected two completely independent clones with 
the integration substrate, attB array CCAG Neo attB array 
(Figure 2). The work described above indicated that the 
integrases were toxic to varying degrees (Additional file 1: 
Table S4) and so we expected that there would be different 
numbers of resistant clones derived from the transfection 
experiments of clones expressing different integrases, des- 
pite similar amounts of plasmid being used in the transfec- 
tion. We controlled for such differences by transfecting the 
integrase-expressing clones with a linearized CCAG Neo 
plasmid in a parallel experiment carried out at the same 
time as they were transfected with the circular attB array 
CCAG Neo attB array plasmid We selected for G418 re- 
sistance in both cases and normalized the experimental 
transfections using the yield of clones generated following 
transfection with the linearized CCAG Neo. We then 
assayed for cassette exchange in the experimental clones 
by selecting the G418 resistant clones for gancyclovir re- 
sistance to identify those clones that had lost the CCAG 
HyTK marker and by PCR in order to identify the attL and 
attR products of a site-specific recombination reaction. We 
carried out three such assays on two independent attP 
CCAG HyTK attP, integrase-expressing clones. The results 
(Additional file 1: Table S6, Figure 5A) of these experi- 
ments revealed consistent and significant differences be- 
tween the seven integrases in terms of their abilities to 
mediate site-specific integration. At one extreme were the 
Bxbl and (pC31 integrases which promoted efficient 
and accurate site specific integration and at the other 
were the three integrases W(3, SPBc and TP901-1 
which generated clones that had the phenotypes 
expected of cassette exchange through site-specific re- 
combination i.e. they were gancyclocvir -resistant and 
G418-resistant, but for which we could not obtain PCR 
products for attL or attR. We sequenced the attL and 
attR sites generated by site-specific integration in the 
Bxbl, (pC31, R4 and tpBTl integrase-expressing clones. 
The Bxbl and <pC31 integrases products were as predicted 
but in three cases the attR product generated by the R4 
integrase was damaged and similarly one of the three attR 
sites generated by the cpBTl integrase was also damaged. 
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Figure 5 Comparing the activity of seven different serine recombinases in human HT1080 cells for their utility in recombinase 
mediated cassette exchange. A. Cell lines containing a single integrated attP array CCAG HyTK attP array reporter construct and stably 
expressing the indicated integrase were transfected with the attB array CCAG neo attB array integration reporter construct using lipofectamine. 
The experiment was carried out three times using two independent cell lines for each of the seven integrases. The number of G418-resistant 
clones generated by the cassette exchange reaction promoted by the different integrases was normalized by transfection using lipofectamine 
with a uniform amount of linearized CCAG neo plasmid. Open bars correspond to the number of colonies generated with the reporter plasmid 
divided by the number of colonies generated with the linearized CCAG neo (for the raw data see Additional file 1: Table S6). Between seven and 
ten colonies of each transfection were picked and assayed for site-specific recombination by PCR and the total yield of colonies generated by 
site-specific recombination is represented by the filled bars. B. Cell lines that had been stably transfected with the indicated integrase expression 
construct were transiently transfected with the indicated integration reporter construct and after three days assayed for site specific 
recombination by PCR. Two different reactions were used for the assay; one for the Bxb1 integrase and the other for the remaining integrases. 
The gel on the final panel shows the PCR reaction products obtained when the indicated reporter constructs were transfected into HT1080 cells 
that expressed no integrase and assayed for site-specific recombination by either of the two reactions. 




The experimental results shown in Additional file 1: Table S6 
could be interpreted to suggest that there is a high back- 
ground of non-specific integration of the circular plas- 
mids into the genome of the HT1080 cells in the absence 
of any integrase. Such an interpretation would only be 
valid however if it the cells containing the empty expres- 
sion vector were transfected with equal efficiency to 
those expressing any of the integrases. Given the toxicity 
of the integrases seen in the results shown in Additional 
file 1: Table S4 this is almost certainly not the case and 
thus the relatively apparently high background of non- 
specific integration simply reflects the fact that the cells that 
do not express integrase are just more easily transfected 
than those that do. 



We wanted to understand how the gancyclocvir-resistant, 
G418-resistant, attL, attR' clones had been generated by 
the Wp\ SPBC and TP901-1 integrases. First of all we 
wanted to confirm that indeed these integrases were active 
in the human cells and in order to investigate this we 
assayed extra-chromosomal site-specific recombination 
following transient transfection. We therefore took one of 
the clones expressing each of these integrases and Bxbl 
integrase as a control and then transiently transfected the 
cells with the integration substrate plasmids either alone 
or together and then confirmed site-specific recombin- 
ation by PCR for each of the respective attR and attL sites 
(Figure 5B). We supposed that the gancyclocvir-resistant, 
G418-resistant, attL, attR' clones generated by these 
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recombinases arose as a result of DNA damage arising 
as a result of abortive recombination. We therefore 
characterized the structure of the remains of the docking 
site (the attP array CCAG HyTK attP array plasmid) and 
the integrated attB array CCAGNeo attB array plasmid in 
10 clones derived by transfection of either the W(3 or 
TP901-1 integrase expressing clones (Figure 6A). This ana- 
lysis showed evidence of target site damage with extensive 
deletion of the HyTK coding region and flanking sequences. 
In the case of the TP901-1 integrase six of the clones (num- 
bers 3, 5, 7, 8, 9 and 10) the internal sequences (2, 3 and 4 
in Figure 6 A) were deleted but flanking sequences (1 and 5 
in Figure 6A) were intact suggesting the possibility that the 
two attP sites had simply ligated together after deletion of 
the intervening DNA. We therefore analysed the residual 



attP sequences in three clones: two, numbers 4 and 9, 
showed evidence of deletion of the residual attP sequences 
(Figure 6B) while in a third, number 8, the residual attP se- 
quence was intact. For both integrases the incoming attB 
array CCAG Neo attB array plasmid also showed evidence 
of DNA damage, but in this case the damage was associated 
with the DNA flanking the cognate attachment sites. Thus 
in eight out of ten of the W|3 derived clones one or other of 
the attachment sites showed evidence of damage and three 
out of the ten TP901-1 derived clones showed evidence of 
damage to one or other of the attachment sites. The con- 
clusion that we draw from these studies is that DNA dam- 
age is limiting the ability of the W|3, SPBc and TP901-1 
integrases to mediate site-specific recombination and cas- 
sette exchange in vertebrate cells and compromises the 
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Figure 6 Analysis of G418-resistant, gancyclovir-resistant, attR', attL' clones generated by the wp and TP901 integrases. A Ten G418- 
resistant, gancyclovir-resistant, attR', attL clones generated by one or other of these two integrases were analysed by PCR across the indicated 
sequences in each of the two reporter plasmids used in the transfection. The numbers below the plasmid maps indicate the PCR reactions 
assayed in the table. Primer sequences used for these assays are listed in the (Additional file 1: Table S7). B. Sequence analysis of damaged sites 
in the three indicated clones. The diagram shows the regions deleted in the remnant of the flanking arrays found in each of three clones. 
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potential utility of the R4 and (pBTl integrases. However 
we are able to rank the utility of the other four en- 
zymes: Bxbl > (pC31 > R4 > (pBTl. 

As before we also tried to assay the ability of these seven 
enzymes to mediate site-specific integration in mouse cells 
using transient expression of the integrase. None were de- 
tectably active suggesting that as in human cells detectable 
integration with this configuration of attachment sites re- 
quires stable expression of the integrase. 

Conclusions 

The major conclusion following from this work is that 
although all of the fifteen unidirectional serine integrases 
for which both attachment sites have been identified are 
active in E.coli, only four of these; Bxbl, cpC31, R4 and 
(pBTl are able to mediate accurate site-specific integra- 
tion into genomic DNA in human cells and rank such 
that Bxbl is marginally better than (pC31 and both are bet- 
ter than R4 and (pBTl integrases. The three integrases 
from the phages W[3, SPBc, TP901-1 are active in verte- 
brate cells as judged by their ability to mediate site- 
specific deletion and by their ability to mediate integration 
reactions extra-chromosomally but fail to complete suc- 
cessful site-specific integration when attempting an inte- 
gration by cassette exchange because they damage one or 



other of the participating DNA molecules. Although there 
are differences (Figure 4) between the activities seen with 
the different integrases in human and mouse cells the data 
the overall pattern of activites in the two cell types is such 
that it would seem prudent to adopt the Bxb 1 integrase 
as the first choice in both. We have not investigated the 
causes of the different activities but they must reflect in- 
teractions between the respective integrases and host 
encoded proteins. 

Our observations have three practical implications and 
pose one question. The practical implications are first 
that the Bxb 1 integrase should be the first choice for 
any genome engineering in vertebrate cells that requires 
the use of a serine integrase. Second, that screening for 
more serine integrases that can be used in vertebrate 
cells is likely to have a low success rate and, third, that it 
is necessary that all site-specific integrants, particularly 
those generated by the R4 and (pBTl integrases should be 
checked for the fidelity of the recombination reaction. 

The results obtained in the integration experiments with 
the W(3, SPBc, TP901-1 integrases pose the question of 
how such damage arises. One possible mechanism is set 
out in Figure 7. In this schema one or other of the attach- 
ment sites on the incoming donor plasmid, attB array 
CCAG neo attB array forms a synaptic complex with an 
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Figure 7 Hypothetical mechanism for the origin of the G418 r gancyclovir r , attfV, attL' clones generated by the serine integrases during 
integration reactions. The figure shows a two step model for the origin of the GANC r Neo r attL' afffl'clones arising during the integration 
experiments. A illustrates the first step in the model; the deletion of the HyTK gene and B illustrates the second step; the integration of the 
CCAGneo gene. Further details are discussed in the text. 
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attP site in the integrated docking cassette located on attP 
CCAG HyTK attP. The attP and attB sites are then 
cleaved but because the two participating sites are embed- 
ded in chromatin, strand exchange and/or re-joining of 
the DNA backbone are inhibited. Possibly the intermedi- 
ate complex, containing the covalent attached integrase 
subunits to the cleaved DNA, is unstable and dissociates, 
or the process of strand exchange by subunit rotation 
is interrupted. The cleaved DNA covalently linked to 
integrase subunits leads to resection of the target attP 
site by DNA repair pathways which in turn leads to 
loss of the counter-selectable HyTK gene (Figure 7A). 
The concomitant double strand break in the attB array 
CCAG neo attB array plasmid enables this plasmid to inte- 
grate efficiently elsewhere in the genome (Figure 7B). How- 
ever not all of the gancyclovir-resistant, G418-resistant, 
attU, attR clones recovered following transfection of the 
W|3 or TP901-1 integrase expressing lines showed evidence 
of damage at one or other of the donor attB sites and one 
assumes that these clones arise by a similar but more com- 
plicated mechanism in which the transfected cell takes up 
more than one of the these plasmids and that it is one of 
the plasmids that have not participated in the abortive at- 
tempt at site specific integration that integrates into the 
host cell genome. 

If chromatin is inhibiting strand exchange by the serine 
integrases, one might also have reasonably expected chro- 
matin to alter other activities of integrase such as site- 
selection. In vitro cpC31 integrase only recombines attP x 
attB and is never active on other pairs of attachment sites 
including attP with attP or attB with attB. This site- 
selectivity is explained as only integrase dimers bound to 
attP and to attB can tetramerise to form the synaptic 
complex. Within this complex activation of DNA cleavage 
occurs. In all of our experiments we did not observed al- 
tered site-selectivity or DNA damage arising from some 
attempt at recombination of two attP sites or two attB 
sites. The absence of any change in site-selectivity in 
eukaryotic chromatin implies that the proposed con- 
formational differences between integrase bound to an 
attP site and to an attB site are robust to withstand 
any fortuitous protein-protein interactions arising with 
chromatin. 

The differences seen between the activities of the 
integrases in E. coli and in vertebrate cells on extra- 
chromosomal substrates on one hand and on substrates 
integrated into the genome on the other and the explan- 
ation for the site damage seen in the integration reactions 
both suggest that chromatin and other DNA binding pro- 
teins are important factors in limiting the activity of site- 
specific recombinases in vertebrate cells. It may therefore 
be of value in future experiments to determine how the 
activity of the integrases vary according to the pos- 
ition of the target sites in the genome. 



Methods 

Bacterial strains, plasmids and molecular biology 

E. coli K12 DH5a was used as a general cloning host 
and propagated on LB medium containing appropriate 
supplements or antibiotics (ampicillin (100 ug/ml), 
chloramphenicol (50 ug/ml), X-gal (120 ug/ml), IPTG 
(40 ug/ml). E. coli BL21 (DE3) was used as a protein 
over-expression host. To construct the integrase ex- 
pression plasmids, the integrase genes were amplified 
by PCR from either phage templates or from plasmids 
and inserted into pET21a vector (Novagen) using ei- 
ther the In-Fusion cloning system (Clontech) or T4 
ligase and compatible restriction sites. The native se- 
quences were used as the templates for the amplifica- 
tion of all integrase genes except for cpC31, A118, FC1 
and ipK38 for which templates were derived from synthetic, 
codon optimised forms (Genescript). Each integrase gene 
was modified to include a StrepII tag at the 5' end and a 
nuclear localisation sequence (NLS) at the 3 ' end. Reporter 
plasmids containing cognate attP and attB sites for all the 
integrases were cloned into pACYC184. PCR was used to 
amplify the lacZa gene using forward and reverse primers 
that contained the attB and attP sites (both approximately 
50 bp in length) in head-to-tail orientation (Additional 
file 1: Table SI). All of the constructed plasmids were 
verified by sequencing (Dundee Sequencing Service). 
The sequences of all PCR plasmids and primers used 
in the bacterial work are listed in Additional file 1: 
Table SI and Additional file 1: Table S2. 

Recombination assays in E. coli 

The activities of the cloned integrases in E. coli were 
assessed by two assays, which relied on different expres- 
sion regimes for the integrases. The expression vector 
pET21a has a T7 RNA polymerase promoter to drive the 
transcription of the integrase genes and this was used to 
ensure expression of the integrase genes in the E. coli 
host BL21(DE3). In the absence of T7 RNA polymerase, 
as is the case in E. coli DH5a, expression of integrase is 
dependent on the recognition of the T7 promoter by 
host RNA pol ymerase, which will result in lower 
expression. 

DH5a cells containing the attB/attP reporter plasmids 
were transformed with the appropriate integrase expres- 
sion plasmid and plated out on LB agar plates containing 
ampicillin (100 ug/ml), chloramphenicol (50 ug/ml), X-gal 
(120 ug/ml), IPTG (40 ug/ml) and incubated overnight at 
37°C Recombination between the att sites deletes the 
lacZa gene from the reporter plasmid leading to white 
colonies and therefore recombinants (white colonies) were 
scored amongst a background of non-recombinants (blue 
colonies). The transformation plates derived from several 
integrases gave rise to only blue or light blue colonies. 
Restreaking of these colonies was performed to determine 
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if white colonies (containing recombinant reporter plas- 
mids) could segregate. 

BL21(DE3) cells containing the attBlattP reporter plas- 
mids were transformed with the appropriate integrase 
expression plasmid and plated out on LB agar plates 
containing ampicillin and chloramphenicol. After an 
overnight incubation a single colony was picked and 
grown in 5 ml 2YT containing ampicillin and chloram- 
phenicol for 5 h at 37°C. The cells were induced with 
IPTG and grown overnight at 20°C after which 1 ml of 
this culture was used to prepare plasmid and the pellet 
from 4 ml was used in Western blotting to estimate 
the expression of integrase. Plasmids extracted from 
the overnight culture were used to transform DH5a 
cells and colonies growing on plates containing chlor- 
amphenicol, X-gal and IPTG after overnight at 37°C 
were examined. The proportion of recombinant versus 
non-recombinant plasmids was scored from the pro- 
portion of white versus blue transformants. PCR was 
carried out on 5 individual colonies for each integrase 
used in the assay to amplify and sequence the attL 
sites, remaining in the recombinant plasmids. Cell pel- 
lets from the overnight cultures were resuspended in 
LB, incubated in SDS sample buffer and the proteins 
separated on a 4-12% SDS gel (Expedeon, UK). The 
integrase protein was detected by Western blotting 
using a monoclonal antibody against the StrepII tag 
conjugated to horseradish peroxidise (IBA, Germany). 
The amount of protein present was determined by 
comparing the intensities of the bands from the Western 
blotting to those from a dilution series of purified 
StrepII-tagged (pC31 integrase. The intensity of the 
bands was determined using ImageJ (NIH) and the 
results were expressed as ug of total protein per ml 
culture. 

In vitro recombination assay 

Integrase recombination activity was assayed in vitro 
using substrates in which the attachment sites were 
present either isolated from other attachment sites or 
within the arrays containing the attachment sites for all 
15 integrases of interest. The combination of substrates 
was as follows: pRT702 (<pC31 attP site) and pRT600 
(cpC31 attB site), pRT702 and p\JC57_attB array, 
p\JC57_attP array and pRT600, p\JC57_attP array 
and pCCAG_aftB array, pCCAG_attP array and pUC57_attB 
array. The substrates were incubated in recombination buffer 
(10 mM Tris pH 7.5, 100 mM NaCl, 5 mM DTT, 5 mM 
spermidine, 4.5% glycerol, 0.5 mg/ml BSA) and cpC31 
integrase (0 - 700 nM) for 1 h at 30°C [26] After heat in- 
activation at 80° C for 10 min the reaction was digested 
with Hindlll or BamHI and the recombination molecules 
detected by gel electrophoresis. Gel images were analysed 
using ImageJ (NIH); the intensity of the bands was 



determined (after subtraction of base line intensities) and 
used to quantify the depletion of substrates and appear- 
ance of products. 

Human cell culture 

Human HT1080 fibrosarcoma cells were grown as de- 
scribed previously except that RPMI 1640 rather than 
Dulbecco's modified Eagles Medium was used as the un- 
supplemented medium. G418, Hygromycin and gancyclovir 
(Invitrogen) were used at 400 mg/ml, 100 mg/ml and 20 
uM respectively for selection. Cells were transfected with 
DNA either by electroporation using a BTX ECM 630 at 
400 V, 50 D. and 250 uF or using Lipofectamine (Invitrogen) 
as recommended by the manufacturers. We used electro- 
poration with 500 ug of linearized DNA and zeocin selec- 
tion for the stable introduction of the integrase expression 
constructs and lipofection of 1.6 ug of closed circular DNA 
for the transient introduction of either substrate or integrase 
expression construct. 

ES cell manipulation and analysis 

Targeting vector construction and gene targeting 

The insertion of the attP-Hygro-TK-attP array at the 
ROSA26 locus was performed by gene targeting in 
E14tg2a ES cells (Additional file 1: Figure SI). A targeting 
vector for the ROSA26 locus, pRosa26.10 [27] was adapted 
by the insertion of an AscI-Nsil-SacII polylinker, made by 
oligonucleotide annealing, into the AscI and SacII sites 3' 
and 5' of the homology arms, creating plasmid pRosa26- 
PL. The final targeting vector, pRosa26-PHB containing 
the attP-Hygro-TK-attB arrays, was constructed by 
inserting the arrays from pBSattPCCAGHyTKattB into 
pROSA26-PL via the 5' AscI and the 3' Nsil sites. 

The targeting vector was linearized by Xhol digestion 
and electroporated into 1x10 E14tg2a cells at 500 V, 
3 uF. Cells were plated on gelatin and recombinant 
clones were recovered by selection in Hygromycin 75 ug/ml. 
Targeted clones were identified by long range PCR screening 
using primers 5 ' -GG CACTACTGTGTTGGCGGA-3 ' 
and 5 '-GGCCAGCTTATCGATACCGT-3 ' for the 5' end; 
5 -AGCGAGGGCTCAGTTGGGCTGTTT-3 ' and 5 -CTC 
AGTGGCTCAACAACACTTGGTCA-3 ' for the 3' end. 
Single copy integration events were confirmed by Southern 
blotting using an EcoRV digest and an internal Hygromycin 
probe. 

Transient assays of integrase-mediated deletion in ES cells 

1 x 10 6 attP-Hygro-TK-attB ES cells were electroporated 
with 5 ug of the integrase vectors and 200 ng of a con- 
trol plasmid containing a functional neomycin cassette 
to control for transfection efficiency. Electroporation was 
performed using the Neon transfection system (Invitrogen) 
(3 x 1400 V, 10 ms). Cells were plated on 2 wells of a 6 well 
plate and selected in either 3 uM Gancyclovir or 350 ug/ul 



Xu et al. BMC Biotechnology 2013, 13:87 
http://www.biomedcentral.com/1472-6750/13/87 



Page 16 of 17 



G418. On the 8 4 day of selection resistant colonies were 
stained in methylene blue and counted. Deletion efficien- 
cies were calculated from the number of gancycolvir resist- 
ant colonies normalised against transfection efficiency by 
assessing the number of G418 resistant clones in the rep- 
lica plating. All comparison transfection experiments were 
repeated twice using 2 independently targeted ES cell 
clones. 

Individual gancyclovir resistant colonies were picked and 
expanded for the preparation of genomic DNA. PCR 
analysis using primers del-Rosa-Fl (5' GATCGAGGTGCC 
CCAACTGGGGTAACCT TT-3 ) and del-Rosa-Rl (5 -C 
CGCGGATGCATAGCGGATAACAATTTCAC-3 ') which 
bind at the very 5 ' and 3 ' of the integrated aftP-Hygro-TK- 
attB array and amplify across the expected deletion events 
(Additional file 1: Figure SI), was performed to confirm the 
deletion event had occurred. PCR products were sequences 
to assess the nature and position of the attP x attB recom- 
bination and the sequences of the resulting attR sites. 
Gancyclovir resistant clones that failed to yield an amplicon 
using this PCR strategy were further investigated for aber- 
rant recombination events using primers which amplify the 
hygromycin sequence (5 - GAAGAATCTCGTGCTTTCA 
GCTTCGATG-3 '; 5 - AATGACCGCTGTTATGCGGCC 
ATTG-3') and the thymidine kinase sequence (5 -TCTG 
GACCGATGGCTG TGTA-3'; 5 ' -AGACGTGCATGGA 
ACGGAGG-3). 

Additional file 
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